Data Replication and Delay Balancing in Heterogeneous Disk Systems

نویسندگان

  • Doron Rotem
  • Sridhar Seshadri
  • Luis M. Bernardo
چکیده

Declustering and replication are well known techniques used to improve response time of queries in parallel disk environments. As data replication incurs a penalty for updates, database designers face the problem of finding which part of the database to load on each disk and how these parts should be replicated. This problem becomes more complicated in heterogeneous environments where disks have different speeds and capacities as intuitively faster disks should be assigned a greater portion of the database to balance the delay in the system. We present analytic results and heuristics providing “near” optimal solutions to the combined problem of finding declustering proportions and replication schemes in such environments. Our model takes into account the characteristics of the disks such as speeds and capacities, as well as, access patterns to the data. Simulation results for various environments comparing different replication strategies are also provided.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Data Availability Using Combined Replication Strategy in Cloud Environment

As grow as the data-intensive applications in cloud computing day after day, data popularity in this environment becomes critical and important. Hence to improve data availability and efficient accesses to popular data, replication algorithms are now widely used in distributed systems. However, most of them only replicate the static number of replicas on some requested chosen sites and it is ob...

متن کامل

Maximizing Throughput in Replicated Disk Striping of Variable Bit-Rate Streams

In a system offering on-demand real-time streaming of media files, data striping across an array of disks can improve load balancing, allowing higher disk utilization and increased system throughput. However, it can also cause complete service disruption in the case of a disk failure. Reliability can be improved by adding data redundancy and reserving extra disk bandwidth during normal operatio...

متن کامل

ارائه الگوریتم پویا برای تنظیم هم‌روندی فرایندهای کسب‌وکار

Business process management systems (BPMS) are vital complex information systems to compete in the global market and to increase economic productivity. Workload balancing of resources in BPMS is one of the challenges have been long studied by researchers. Workload balancing of resources increases the system stability, improves the efficiency of the resources and enhances the quality of their pr...

متن کامل

Randomized Data Allocation for Real - time Disk I / O Steven

Steven Berson R. R. Muntz W. R. Wong USC Information Sciences Institute Computer Science Department Marina del Rey, CA 90292 UCLA Los Angeles, CA 90024 Abstract Continuous media such as video or audio from databases that are disk resident require real-time disk I/O support. Video on demand systems have been widely studied and most proposed designs take advantage of the (largely) predictable nat...

متن کامل

An Efficient Fault-tolerance Technique Using Check-pointing and Replication in Grids Using Data Logs

Grid computing systems are increasingly growing importance in the present world with advances in the network technology. Grids are composed of many geographically disturbed resources, each having its own administration domain. Grid computing involves decentralized heterogeneous, geographically distributed resources that can work on a job together. Since the resource availability is dynamic in n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998